Faster time-aligned phonetic transcriptions through partial automation

نویسندگان

  • Ben Serridge
  • Luciana Castro
چکیده

A semi-automatic process for generating time-aligned transcriptions of speech data at the word and phone level is described. At each stage in the process, segment durations are estimated to generate approximate boundary markers, which are then corrected by hand. Corrections at one level are taken into account in the generation of boundaries for the next level, such that the error is reduced at each successive stage. A test implementation based on Praat was applied to a corpus of Brazilian Portuguese and a comparison against a fully manual process revealed a reduction of 54% in the time required to generate phonetic transcriptions and an average error of 21 ms in the time-alignment of phonetic boundaries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application-oriented validation o preliminary r

There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recognition technology. Such automatic phonetic transcriptions are usually validat...

متن کامل

Title : Automatic Phonetic Transcription of Large Speech Corpora

Most large speech corpora are delivered with a lexicon that contains a canonical transcription of every word in the orthographic transcription. Such a lexicon can be used for generating a hypothetical ‘canonical’ phonetic transcription from the orthography. In addition, time and money permitting, some speech corpora are provided with a manually verified broad phonetic transcription of at least ...

متن کامل

A pplication-orien ted validation o f phonetic transcriptions: prelim inary results

There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recogni­ tion technology. Such automatic phonetic transcrip­ tions are usually val...

متن کامل

Automatic phonetic transcription of large speech corpora

This study is aimed at investigating whether automatic phonetic transcription procedures can approximate manual transcriptions typically delivered with contemporary large speech corpora. To this end, ten automatic procedures were used to generate a broad phonetic transcription of well-prepared speech (read-aloud texts) and spontaneous speech (telephone dialogues) from the Spoken Dutch Corpus. T...

متن کامل

Validation of phonetic transcriptions based on recognition performance

In fundamental linguistic as well as in speech technology re­ search there is an increasing need for procedures to automat­ ically generate and validate phonetic transcriptions. Whereas much research has already focussed on the automatic genera­ tion o f phonetic transcriptions, far less attention has been paid to the validation of such transcriptions. In the little research performed in this a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008